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Abstract 

In this paper, we study the non-bipartite maximum matching problem in the semi-streaming 
modeL The maximum matching problem in the semi-streaming model has received a significant 
amount of attention lately. While the problem has been somewhat well solved for bipartite 
graphs, the known algorithms for non-bipartite graphs use 2« passes or n~! time to compute a 
(1 — e) approximation. In this paper we provide the first FPTAS (polynomial in n, -i) for the 
problem which is efficient in both the running time and the number of passes. We also show 
that we can estimate the size of the matching in 0{^) passes using slightly superlinear space. 

To achieve both results, we use the structural properties of the matching polytope such as 
the laminarity of the tight sets and total dual integrality. The algorithms are iterative, and 
are based on the fractional packing and covering framework. However the formulations herein 
require exponentially many variables or constraints. We use laminarity, metric embeddings and 
graph sparsification to reduce the space required by the algorithms in between and across the 
iterations. This is the first use of these ideas in the semi-streaming model to solve a combinatorial 
optimization problem. 
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1 Introduction 



In this paper we consider the following question: Suppose that we are given a weighted (non- 
bipartite) graph G = {V,E,w) with \ V\ = n and \E\ = m. Can we design a FPTAS for computing 
a (1 + e) approximate maximum matching using only 0{n) additional space (not counting the 
input list of edges) such that we make a few passes over the list of edges? This question arises 
naturally in the semi-streaming model of computation. In this model we are given near linear (in 
n) space and we are allowed to make a small number of passes over a graph which is specified as an 
arbitrary (and possibly adversarial) list of edges. This is one of several natural models for processing 
massive graphs and the maximum matching problem has received a significant amount of attention 
O [m El [23l El dll [2]. For bipartite graphs the problem is somewhat well understood, based on 
a sequence of results [H [Ml El [2]. The state of the art is a 0(^ log i) pass algorithm to compute 
a (1 — e) approximation to the weighted maximum matching problem using 0{n ■ poly{^)) space 
and 0{m ■ poly{-)) time. However the status for non-bipartite graphs has been significantly more 
complicated. The results on bipartite graphs do not extend beyond a | approximation for the case 
of non-bipartite graphs. In the case of non-bipartite graphs, we need to satisfy a significantly larger 
set of constraints, which correspond to odd subsets of vertices. To achieve a (1 — e) approximation, 
the current algorithms either use 2"? passes |16] or time [2j; both of these results try to identify 
and enumerate over subsets (or paths) of size 0{-). The open question, therefore, is about designing 
an (1 — e) approximation algorithm which is efficient (in time polynomial in both n and ^, as well 
as a small number of passes). The central difficulty in designing such an algorithm is that any 
such algorithm would have to handle exponentially many subsets. In this paper we achieve such 
an algorithm using several novel ideas. We believe that the techniques developed in this paper will 
be useful in other scenarios where we wish to manage exponentially many constraints in a space 
efficient manner. 

The difference between bipartite and non-bipartite graphs has been one of the cornerstone 
results in combinatorial optimization and a very rich literature that describes the non-bipartite 
matching polytope exists (see Schrijver [2T]). The central idea of this paper is that the celebrated 
structural properties, such as the laminarity of the sets and the total dual integrality of the matching 
polytope (Cunningham and Marsh, [21]) can be used to design space and pass efficient algorithms 
for maximum matching. In particular we show that a non-adaptive perturbation to the constraints 
preserves the laminarity of odd sets (we only require this property for small size sets) in the matching 
polytope. The known proof of the Cunningham and Marsh theorem requires an adaptive two-stage 
optimization, see [21]. In that proof, in the first stage an optimum matching solution is chosen, 
and in the second stage a quadratic function is minimized (while retaining optimality). Solving 
the first step of their process in a space efficient fashion is the main challenge in our context; and 
thus the ideas in that proof do not apply directly. Intuitively, however, their proof suggests that 
we try to juggle two different objectives. We achieve this with the idea of a carefully designed 
perturbation, and a construction of outer LP and an inner LP. Both of these are solved in a space 
efficient manner. We believe this construction is of interest to the matching problem, independent 
of the semi-streaming model. 

This paper also highlights the interesting conundrum between solving the primal problem of 
constructing a matching and the dual problem of covering edges by odd sets. The latter provides 
an estimate of the former and yet fails to give a construction of the matching. It appears that the 
latter can be solved with significantly better parameters (in the number of passes), albeit using 
randomization. In fact the number of passes used for the estimation of the size of the matching 
in non-bipartite case improves upon the number of passes required to construct a matching for 
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a bipartite graph. Such a discrepancy between the packing and covering formulations was also 
observed by Onak and Rubinfeld [17j in a somewhat different setting. In the context of this 
paper, we show that the covering problem (which estimates the size of the matching) can be solved 

in p/e passes (which is independent of n) for any p > 1, using slightly superlinear (n^^'^^p^) 
space in the semi-streaming model using both graph sparsification and the Johnson-Lindenstrauss 
lemma. Thus the paper also demonstrates the use of these ideas in a combinatorial optimization 
problem in the semi-streaming model. The small number of passes can be achieved because the dual 
formulation of matching has a very small number of constraints (in comparison to the space allowed) 
but exponentially many variables, in contrast to the primal formulation which uses exponentially 
many constraints (and m variables). However, the apparent complexity this estimation algorithm 
increases in comparison to the algorithm which constructs the matching, since in the dual we wish 
to solve a subproblem with exponentially many variables in each single pass and require stronger 
certificates of progress in between different passes. We require the certificate to be sufficiently 
simple, such that we can "compress" multiple iterations of the framework using a single pass of the 
stream. Thus the matching problem illustrates a rich tradeoff of passes, running time and space, 
as a function of the structure of the problem constraints. 

Our Results: In this paper we provide two main results. 

• First, we show that we can construct a maximum weighted matching in 0{jf log ^ log n) passes 
using a deterministic algorithm that uses 0{{m-\-n^) ■poly{^)) time and 0{n ■ poly{^)) space 
(Theorem [Tj). Moreover, we show that after suitable transformations, and using a notion 
of "effective weights", the problem reduces to an adaptive sequence of weighted bipartite 
matching formulations. These bipartite matching instances can then be solved in small space 
and a small number of passes using previous results, such as in [2]. We note that the reduction 
of the non-bipartite matching to a (relaxation of) bipartite matching is likely of independent 
interest. 

• Second, we consider the problem of estimating the weight of the maximum matching (which 
uses the dual linear program) and show that we can estimate the weight in p/e passes (which 
is independent of n) using a randomized algorithm that uses 0{{m + n^) • poly{^)) time and 

n^~^^^p^ space (Theorems [23] and [32]) . Note that the number of passes is better than all 
previous results for bipartite graphs. However, the comparison is somewhat inexact because 
those previous results on bipartite graphs construct an explicit matching as well. 

Our Techniques: Both the results use (and slightly modify) the framework of approximately 
solving packing and covering linear problems of Plotkin, Shmoys, and Tardos [19j. There has been a 
long sequence of results for similar approaches [22l [Til [El [lOl 13] , but the approach of [H] explicitly 
allows side constraints specified as a polytope. We rely strongly on the ability to specify these side 
constraints and their duals. In fact one of the major contributions of this paper is to indicate which 
constraints should be added as side constraints. 

In the first result we alter the constraints of the matching polytope by adding small perturba- 
tions to each odd set constraint. We then use a strengthening of the framework of [19j, by adding a 
stronger exponential penalty than is mandated in [T9]. As a consequence we are able to show that 
the (most) violated constraints form a laminar family. This allows us to reduce the exponential 
number of constraints (corresponding to the odd sets) to a linear number of relevant constraints. 
However finding the subsets still requires a non-trivial algorithm based on the minimum odd cut - 
and moreover the perturbation and penalties have to be carefully balanced to allow the reduction 
to the minimum odd cut problem. Finally, both the perturbation and the penalty affect the rate of 
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progress (number of passes) of the algorithm. Designing this perturbation function is a contribution 
of this paper and is likely to be useful elsewhere. 

In the second result we consider the dual formulation with m constraints and exponential 
number of variables. We add a side constraint that is only true for optimum solutions which satisfy 
laminarity ~ the existence of such solutions are guaranteed by the Cunningham-Marsh Theorem. 
The covering framework iteratively provides weights to the edges (which correspond to the dual of 
the dual) and solves a subsidiary problem in each iteration. The space issue arises in both contexts. 
To provide the weights, we embed the weights into L2-distances between n vectors, and the vectors 
can be compressed in 0{n) space using the Johnson-Lindenstrauss lemma. The subsidiary problem 
now reduces to a minimum odd cut problem as a consequence of the added side constraint. However, 
the input to the subsidiary problem is still an 0{m) size (streaming) vector of weights. To solve the 
problem in small space, we use streaming graph sparsification [1] to reduce the input to 0(ne~^). 
This is akin to reducing the number of relevant constraints. This sparsified problem is solved using 
the multiplicative weights update framework described in Arora, Hazan and Kale f3]. While this 
approach appears to be convoluted, a benefit of the approach is that the number of steps of the 
multiplicative weight update algorithm can be compressed in a single sparsification step ~ and 
random sampling can be used to compress the overall number of passes to a fairly low number. 

Roadmap: We present a brief overview of the fractional packing and covering framework 
described in [19] in Section [2j We also present a brief overview of the multiplicative weight update 
method of f3]. We then present the result on constructing the weighted maximum matching in 
non-bipartite graphs in Section [3l The cardinality estimation algorithm using constant number 
of passes is presented in Section |4l Finally, in Section [5] we show how to extend the estimation 
algorithm to weighted maximum matching. 

2 Preliminaries 

In this section, we explain two frameworks for solving linear programs. In Sections 12.21 and 12.31 
we explain the frameworks for fractional packing and covering problems, respectively, as presented 
by Plotkin, Shmoys, and Tardos [20) . In Section [2.41 we explain the multiplicative weight update 
method surveyed by Arora, Hazan, and Kale [3j. These approaches share the similar high level 
ideas but have differences in details, especially in handling side constraints. 

2.1 The Matching Polytope 

Definition 1. Given a graph G = {V, E) and a matching M , let be an indicator vector of edges 
in M, i.e., yij is 1 if {i,j) G M and otherwise. Then, the matching polytope of G is a convex 
hull of for all matchings in G. 

Theorem 1. \21^ For any graph G = {y,E), the polytope defined by the following constraints is 
the matching polytope: 



Observe that the feasibility of the constraint for a set U where \U\ is even follows from the 
feasibility of the vertex constraints, since each yij is summed up twice for i,j G U . Therefore it 
suffices to focus on odd (size) sets U . Observe that the same observation holds for approximate 
feasibility, where the right hand sides of the equations corresponding to the constraints are multi- 
plied by (1 — e). From the definition of the matching polytope and Theorem [H we can construct 



Ej:(»j)e_E2/»j < 1 forallieV 
E,,,w2/u< ^ forallUCV 



(vertex constraints) 
(set constraints) 
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a linear program for the maximum matching problem with integrality gap one. ILPII and ILP2I are 
the primal and dual linear programs for the maximum weighted matching problem. The maximum 
cardinality matching problem is the case when Wij = 1 for all G E and have a similar linear 

program formulation (See ILPTl and ILP8]) . 



< 



s-t J2j y^3 < 1 ^^^^^ s.t X, + + E» JSC/ zu > V(i, j) e (LP2) 

The matching polytope is a well-studied object. One of the celebrated results on the matching 
polytope (with relation to the maximum matching problem) is the Cunningham-Marsh theorem. 

Theorem 2 (The Cunningham-Marsh Theorem [5l |2T]). // all edge weights are integers, there 
exists an optimal solution for the maximum weighted matching linear proaram kLP 1\) such that all 
Xi and zu are integers and {U\zu > 0} is laminar. 

Note that the theorem does not claim that Xi, zjj are 0/1 - which is only true when Wij = 1. This 
issue is important in solving the LP for the weighted matching case. The most accessible proof of 
laminarity that uses an adaptive two-stage optimization (See |21] . volume A, pages 440-442). The 
proof finds an optimal solution that maximizes zi/\U\'^ (among optimal solutions) and shows 
that such an optimal solution satisfies the laminarity. The integrality of the optimal solution follows 
from the laminarity. 



2.2 A Framework for Fractional Packing 

Plotkin, Shmoys, and Tardos [20J presented a framework for solving fractional packing and fractional 
covering problems. A fractional packing problem can be written as follows: 

min A 

s.t. Ax < Ab 

X e -p 

where P is a convex polytope. The algorithm assumes that min^jg^ C"^x can be computed efficiently 
for all c. 

Definition 2. The width parameter ofV is defined as p = maxj maxxep Ajx/bj. 



Algorithm 1 Framework for the fractional covering [20] 

1: Let Aq = maxi Aix/bi. 

2: Let K be a parameter with k > AX^^S^^ ln{2m5^^) and a = 

3: while maxi A^x/bi > (1 ~ S)Xo and y do not satisfy (P2) do 
4: Yi F" exp(KAix/bj) for all i. 
5: X ^ argmin^gp y^Ax. 
6: X ^ (1 — (t)x -I- crx 

7: end while 



The framework (given in Algorithm [T]) consists of multiple iterations of two-party game between 
the algorithm and an oracle. The algorithm maintains a primal candidate x and computes a dual 
candidate y (using exponential weights). Then, the oracle solves a subproblem given y(line[5|) and 
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returns the solution to the algorithm. The algorithm updates its primal candidate and proceeds to 
the next iteration. In this process, the algorithm is fixed but the oracle varies depending on the 
problem. So we need to design a problem-specific oracle in order to use the framework. 

Definition 3. J2U^ Let \^^^ he the optimal solution of the fractional packing problem. Let A = 
maxj Ajx/bj and Cp{y) = miuxep y-^Ax. x is 6-optimal if :k ^ V and A < (1 + (5)A'^^"^. 

(PI) (1 - S)Xy^h < y^Ax 

(P2) y^Ax - Cp(y) < <5(y^Ax + Ay^b) 

Lemma 3. If x and y satisfies (PI) and (P2) with 6 < 1/6, x is 65-optimal. Moreover, 

let ^ = y^b be a potential function and let ^ be the potential function after the update. Then, 
i < (1 - KCT(5A)$ < (1 - 

For any x, its corresponding y in Algorithm [1] satisfies (PI). Therefore, if (P2) is also satisfied, 
then we have a proof that x is near-optimal. If on the other hand, (P2) is not satisfied then the 
algorithm shows that the potential drops significantly. Initially, $ < mexp(KAo) and the algorithm 
terminates if $ < exp((l — 5)kXq). Combined with the above lemma, we obtain the following: 

Theorem 4. Given the initial solution with objective value Xq, we find a solution with objective 
value (1 — 5)Ao in 0{^) iterations. 

Theorem m holds even if we find x approximately, i.e., find x such that y-^Ax < (1 + 5/2)Cp(y*) + 
{S/2)Xy'^h |20j . It is also sufficient to compute y-^Ax and Ay-^b approximately (within a 1 — 6 
factor) based on the same ideas. Note that there is a slight difference between our parameters and 
the parameters in [20] which uses the smallest possible k, namely k = 4Aq ^(5~^ ln(2m5~^). We will 
use a larger value for k in Section [3l This allows us to use structural properties of the polytope 
and subsequently allows us to approximate x easily. 

2.3 A Framework for Fractional Covering Problems 

The framework for fractional covering problems is almost identical to the framework for fractional 
packing problems except two differences. First, we use y = ^ exp(— ^Ajx/bj). Note that the 
exponent is negative rather than positive. Second, (PI) and (P2) conditions are replaced by the 
following conditions. 

Definition 4. 1201 Let X = miuj Ajx/bj and Cc(y) = miUxepy'^Ax. x is 6-optimal if x ^ V 
and X > {1-6)X*. ' 

(CI) (1 + (5)Ay^b > y^Ax 

(C2) Cc{y) - y^Ax < S{y^Ax + Ay^b) 

The subroutine to find x is a maximization problem given the cost y, i.e., argmax^gp y^Ax. The 
rest of the algorithm and proofs is almost identical to the fractional packing framework. We have 
the following theorem. 

Theorem 5. 120^ If the initial solution is Qe-optimal for e < 1/12, we find a Se-optimal solution 
in 0(e^^/9log(me~^)) iterations. 
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2.4 The Multiplicative Weights Update Method 

Similar to the frameworks for the fractional packing and the fractional covering problems, the 
multiplicative weights update method [3] consists of multiple iterations of two-party game between 
the algorithm and an oracle. A (minimization) LP and its dual program can be written in the 
following conventional forms where A G W^^'^,h G M",c G R™: 

r mill b^x r max c^y 

^^•^ s.t ATx>c, x>0 UualLF.j Ay<b, y>0 

The algorithm maintains weights u* for dual constraints Ajy > bj (these weights correspond 
to a candidate for a feasible primal solution) and the oracle returns a dual witness y* or declare 
a failure, which reduces the duality gap (see the definition of admissibility below). If the oracle 
succeeds and returns a dual witness, it improves the candidate solution and if the oracle fails to 
return a dual witness, it proves that the primal solution is (near) optimal. 

Algorithm 2 The Multiphcative Weights Update Meta-Method [3j 

uj = 1 for all i E [n]. 
for i = 1 to T do 

Given w*, the oracle returns a dual witness y*. 
Let M(i,y*) = A,y*-6,. 
Assert -£ < M(i,y*) < p. 

' ul{l + e)^^''^y">/P ifM(i,y*)>0 
\^ ul{l-e)-^'-'-y'yp ifM(2,y*)<0 
7: end for 

8: Output y X)ty* (along with any correction induced by failure of the oracle). 

We have the following definition of dual witness and the corresponding theorem. 

Definition 5. We define M(i,y*) = A^y* — 6j to be a violation for Dual constraint i. The 
expected violation M(P*,y*) is the expected value o/M(i,y*) when choosing i with probability 

proportional to u\, i.e., y^"' t M(i, y*). The dual witness is defined to be admissible if it satisfies 

M(P*,y*)<(5, c'^y*>a, and M{i,y*) e [-t, p] with t < p 

p is defined as the width parameter of the oracle. The parameters e and T depend on p, £, and the 
desired error bound 5. 

Theorem 6. ^31 Let 6 > be an error parameter. Suppose that M(i,y*) G [—i,p] with £ < p. 
Then, after T = |^(") rounds and using e = min{^, for any constraint i we have: (1 — 
e)EtM(i,y*) < (5r +'^^ M(P*, y*) and thus M(i,i^jy*) < 26 (since M(P*,y*) < 6 for all t). 

3 The Maximum Matching Problem 

Our algorithm StreamMatch proceeds as follows. 

1. We first find a 6-approximation using the algorithm of [9]. We can now guess the optimal 
solution up to (1 — 5) factor using O(^) guesses. 
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2. For each guess a of the optimum solution, we formulate the problem into testing the feasibility 
of a fractional packing problem. There is a well-known formulation for the maximum matching 
problem in general graphs whose integrality gap is 1 based on odd set constraints. Using 
subsets of size at most |, the same relaxation provides a {1 — 6) approximation - however, 

the formulation is still difficult to solve efficiently (in time polynomial in n,^, and small 
number of passes) in the semi-streaming model. The following is the altered formulation 

where f{i) = —d'^i'^/i. This choice of /() requires k = 4\Q^d~^ln{2m'6'^) where m' = n^. 



min A 

J:^.,eu < ^ f ^ + f(\U\)) for all odd sets U with \U\ < i 



< 1 for all i 



(LP3) 



The width parameter of the formulation is 0(1). The function /() can be thought of as 
adding a small perturbation to each odd set. Observe that the formulation restricts the size 
of the sets (otherwise the perturbation would overwhelm the constraint). We discuss the 
details in Section [3TT1 

3. We then consider applying the fractional packing framework (discussed in Section [2.2p . The 
framework requires the computation of A which is defined over constraints. Moreover, 

the framework assigns n* weights Zfj (corresponding to the constraint U for each odd set in 
the formulation) as well. We replace zjj by z[j where {U\z'fj 7^ 0} is a laminar family. We 
then show that computing z[j (and A) exactly on this laminar family, suffices to produce a 
(1 — 0{5)) approximation oracle. In the process of computing z'jj,\, we need to solve the 
following subproblem where g{£) = — (5^£^/2. 



min > 
u ^ 

ieu 



The choice of g{) ensures that the subproblem can be reduced to a polynomial number of 
minimum odd cut problems. Moreover, we are performing a parametric search where we 
search for sets of size £ and the function g satisfies (among other conditions) that + 
f{i) — g{t) is a convex function of (. minimized at I = b. We present two algorithms: a simple 
but inefficient algorithm in Section [3.21 and an efficient but complex algorithm in Section [ 



4. Finally, we consider solving the oracle problem required by the framework. Given z'jj, the 
oracle finds y E V such that "^-j yij j z'jj is (approximately) minimized. 

(a) We now switch the constraint Yli j'^ijVij — the objective function, based on 
guessing the minimum value /3 of the objective function. 

(b) We use the Lagrangian multiplier technique and obtain the following LP (which is solved 
for several values of 7) , 

max T,^JeE ^nVn + 7(/3 - E»j V^j T,^,jeu ^c/) 

J2-i Vv ^ 1 foi' all i 



Given 7, the LP objective function rewrites to 7/3 + "^ij ywij — lYlu-i jell ^Uj Vir This is 
identical to the LP formulation of maximum weighted matching in bipartite graphs where 
we use an "effective weight" of Wij — lYliU-i j& '^'v ^ot^ ih.ai we are not saying that the 
graph is bipartite - just that this LP has a simple structure. These effective weights can be 
computed by a streaming algorithm and there is a semi-streaming algorithm for this LP [2]. 
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We enumerate all possible values of 7 (we show that only a small number of values matter) 
and solve the LPs in parallel. Based on an appropriate choice of (a linear combination of) 7 
we get j3 = "Yli j Vij J2ijeU -^'u- check if the objective function is larger than (1 — 0{5))a. 
Therefore we can solve the overall oracle approximately. This is explained in Section [3.41 

As a consequence of the above we can show the following: 

Theorem 7. There is a {1 — e)- approximation algorithm that runs in 0( ^^(log i)(log n)) passes, 
O(^) space, and 0(^(m + r)) time where r is the time for finding a minimum odd cut over 0{^) 

~ 3 

edges (t = O(^) suffices). 

Roadmap: The fractional packing framework is shown in Section 13.11 We show how to approxi- 
mate A and zu is Sections 13.21 and 13.31 We describe the Lagrangian relaxation based solution in 
Section 13.41 Finally we summarize the discussion in the proof of Theorem [T| in Section 13.51 

3.1 Fractional Packing Formulation 

We have the following LP for the maximum weighted matching in general graphs, see ILPII in 
Section 12.11 

max Y.^,o ^vV^j 

J2j < 1 for all i 

Let f{£) = -6^f/A. We alter EpH as follows: 

max ^^3V^3 

Ei y^J < 1 for all i (LP4) 

E.jea y^3 < ^ + for all odd sets U with \U\ < i 

For the rest of this section, U refers an odd set of size at most | unless mentioned otherwise. We 
show that ILP4I approximates ILPll 

Lemma 8. Let OPT be the optimal solution for \LPli There exists a feasible solution for \LP4\ with 
objective value at least (1 — 5)OPT. Moreover if there is a feasible solution for \LP4\ with objective 
value a, there is a feasible solution for \LPl\ with objective value (1 — 5)a. 

Proof. For the first part of the lemma, let y be the optimal solution for lLPll and let y'^j = {l — 5)yij. 
The objective value of y' is (1 — 5)0PT. From the definition of / and \U\ < f{\U\) > — J ^^^ 

Therefore Ei.euV'.j = (1 " E^,Jeuy^J < (1 " '^)^ < ^ + /(I^D- 

For the second part of the lemma, let y be a feasible solution for ILP4I with objective value a. 
Let y'^j = (1 — 5)yij. Then, the objective value of y' is (1 — 6)a. Since we scale down the solution, 
the constraints corresponding to nodes and odd sets of size less than | are satisfied by y'. For 
larger odd sets U, we have v'ij < h E.et/ y'^, = ¥ T.^^u Vi^ < < ° 

Now we formulate the problem as a fractional packing problem, which is 1LP31 The corresponding 
oracle is: 

p f y^3 < 1 for all i (LPS) 
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3.2 A Simple Algorithm for Estimating zjj and A 

The difficulty in solving ILP5I arises from the fact that the number of variables zjj is large. There 
are n®*-^-* odd sets of size at most ^. So even computing all zjj would take exponential time in ^. 
Fortunately, we only need an approximation solution for ILP5I and most of zu contribute little to 
the objective value. So we replace zjj by z'^j which are defined as follows: 

Definition 6. For odd sets U with 1 < \U\ < ^, let Yu = Y^ijeuVij' = + f{\U\), 

Xu = Yu /hjj . Let A = mmu A{/. Let 




otherwise 



and L = {U : z'uj^ 0}. 

In the rest of this section, we present an algorithm to compute A and z'fj exactly. Lemma [9] shows 
that we can approximate ILPSl using z'jj. Lemma [10] means that either we have a (near) optimal 
solution OT L = {U\z'jj > 0} is a laminar family. If L is a laminar family, all non-zero z'jj values 
can be stored in 0{n) space, across all the iterations. Based on these two lemmas, we prove 
Theorem [TT] which reduces to computing the minimum odd cut - however we need to parametrize 
the minimization to explicitly avoid large sets. For the sake of continuity of the discussion we first 
state the lemmas and the theorem we prove, and provide their proofs subsequently. 

Lemma 9. For any y' E V, v'ij T.i,j&u approximates Y,ij v'ij J2i,j(^u within ((5/4)A Y,u ^ubu 
additive error. In addition, ^Y^jj z'fjbjj approximates \ zubu within (1 — 5) factor. 

Lemma 10. // A > 1 + 25, L is a laminar family for a sufficiently small 5. 

Theorem 11. There is an algorithm that finds A and L in polynomial time and near linear space. 
Proof of Lemma |9] 

As mentioned in Section [3.11, the parameter k was set as k = 4Aq ^5^^ ln(2m'(5~^) in |20j . where m! 
is the number of constraints. Note that m' = n^^s^ for the matching formulation we have. 

We use K = 4Ao ^(^~^ln(2m'5"^). Intuitively, we increase the cost of constraints with large 
violation so that only a smaller number of constraints matter. However, the larger value of k will 
increase the number of iterations. 

Proof of LemmalB Let $ = XJ^u^ubu- Then, \ Y.u ^ubu > A^^eM). For U with Ac/ < A - 25^ 
[z'^ = 0), we have zu < 2exp(K(A(7 - A) + kX) < 4exp(-K5^)$ < 4(n~7)^<I>. Therefore: 

J2 y'v ^u-Y. y'^j H ^'u ^ y'^j Y - z'u) <Y^^u- z'u) J2 ^\ Y 

ij iJ^U ij hj£U ij i,j€iU U i,j€iU U:z[j=0 

6 4 
For X^jj z'jjbu , we have a similar inequality; namely 

X'^zubu - X'^z'jjbu < X zubu<2^ 

Therefore the lemma follows. □ 
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Proof of Lemma [TOl 

Now we can focus on U with Xu > X — 5^ and show that such odd sets form a laminar family. 
The intuition behind the proof of the laminarity (and the choice of f{i)) is from a proof of the 
Cunningham-Marsh theorem (See [21], volume A, page 440-442). For the sake of the proof, we 
extend the definition of Yjy, bu, and Xjj (see Definition [6|) to odd sets of size at most |. 

Proof of Lemma \1(A Suppose that A > 1 + 26 and L is not a laminar family. Then, there are 
A,B & L such that ACi B ^ ^, A, B. There are two cases depending on |^ n 

Case I: \AnB\ is even. Let Qa = EieAns EjeA-B Vij and Qb = Yli&AnB T.jaB-A Vij- Without 
loss of generality, assume that Qa ^ Qb- Let C = A — B and D = Af] B. Then, 



Yc = Ya-Qa- Yd and < i ^ y,, - - Qs < ^ 



D\ Qa + Qb 



2 2 



Since 6^ = ^ + f{\A\), be = + f{\C\), \D\ = \A\ - |C7|, we have 

Yc>.YA-\^-^ + f>_YA^^^ 

bA-bc = ^^ + /(|A|)-i^^-/(|C|) = ^tM + (/(|A|)-/(|C|)) 
= M + ^l{\A\ + \C\)i\A\-\C\)>{l-S)\^ 



For the last inequality, we used the fact that |^| + |C| < 



<5- 



Xcbc > ^AbA - (1 - S)-\bA - be) > \Abc + (Aa - (1 - 5)-^){bA - be) > \Abc + ^{bA - be)- 

The last inequality is from the assumption that A^ > X — 6^ > 1 + 26 — 6^. Since be < ^ and 
bA — be > 1 — S, we get Xc > Xa + ■ But this contradicts the assumption that Xa > X — 5^ since 
A > Ac. 

Case II: \AnB\ is odd. Let C = An B and D = AU B. Let t = {\A\ + \B\)/2 = (|C| + \D\)/2, 
p = \\A\ — t\, and (7 = ||C| — t|. Then, q > p + 2 and we have 

Since bA + bB< |, be + bo < {I - 5^){bA + be). Wlog, we assume that Xa < A^- Then, 

Xcbc + XDbD ^Yc + Yd^Ya + Yb^ XaBa + XsbB > Aa(6a + bn) 

If both Ac and A/j is less than Xa + S'^, 

Xcbc + XobD < {Xa + S^){bc + bD) <Xa{1 + 6^){l - S^){bA + &s) < Aa(6a + be). 

So Ac or Ad is greater than A^ + which means that either X > Xa + or Xd > Xa + 5^ with 
\D\ > |. The former contradicts the assumption that Xa > X — 6^. In the latter case, we have 
Yd < \D\/2 and 
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Hence, Xd = Y£)/b£) < 1 + 25 and Xa < Xd — 5"^ < I + 26 — 6^ which contradicts the assumption 
that A > 1 + 25 and Xa > A — 5^. From the above two cases, L is a laminar family if A > 1 + 26. □ 

Since L forms a laminar family, we can store L in 0{n) space. In addition, since \U\ is bounded 
by |, the computation for X^jjet/'^f^ given an edge can be performed in 0(|) time. 

Proof of Theorem [TT] 

Finally, we present algorithms to compute A and z'jj. We use an auxiliary function F{X*,£,U) to 
reduce the problem into the minimum odd cut problem. F{X* U) is defined as follows. 

Definition 7. Let g{i) = and h{£) = Let 

F{X*,£, U) = J2 

leu 



X* 1 



E 



EE 



Algorithm 3 Find sxgmhijj F{X* ,£,U) 
1: If \V\ is even, add a node to V without any adjacent edge so that \V\ is odd. 
2: Construct a graph G" = U {s}, £", w) such that E' contains 

(i) with Wij = y.y and 

(ii) [i, s) with ^ X* {l - - yij- 

3: Find a minimum odd cut ([/, U {s} — U). 
4: return t/ 



Algorithm[3]returns an odd set U that minimizes F{X* U). In fact, the cut value of ([/, VU{s]—U) 
is exactly F{X* ,£,U). We repeatedly use Algorithm [3] for finding an odd set U such that Xu > A* 
and \U\ = £. F{X* ,£,U) satisfies the following three properties. Lemma [T2l shows that if there is 
such an odd set. Algorithm [3] finds an odd set. Lemma [13] and Lemma [HI shows that we always 
find a set U with \U\ = £ (given A* close to A > 1 + 25). 

Lemma 12. If there exists a set U with \U\ = £ and Xu > X* , F{X*,£,U) < A*(l - 2h{£)). 

Proof. From the definition of A[/, we have 2yjj = Xi/{\U\ — 1+2 f{£)). Since A* is less than Ac/, 

we have J2i,jeu '^Vii >X*{\U\-l + 2f{£)). Applying Y.i,j&u '^V^i = J2ieu Ej Vij - J2ieu T.jiu Vij 
and f{£) = g{£) + h{£), we obtain the desired result. □ 

Lemma 13. If \U\ ^ £ and X* >Xu-6^, F{X*,£,U) > X* {I - 2h{£)) . 

Proof We haveF(A*, A U) - A*(l - 2h{£)) = X*{\U\ - 1 + \U\^ + 2h{£)) - Y.i^j^v'^Vir 

Let fu{£) = \\J\^ + h{£). Then, fu{£) = ^{£^ - 2\U\£) which is a convex function minimized at 

1=\IJ\. In addition, fu(\lJ\) = f{\U\) and fuii) > fi\U\) + 26^ for £ ^ \U\. So 

\U\-1 + \U\^ + 2h{t) >\U\-l + 2J{\U\) + A5^ > (1 + A5^){\U\ - 1 + 2f{\U\)) = (1 + 5^)2hu 
Also je(7 ^ '^^ubu- From X* > Xu - 6^ > Ac/(1 - 6^), we obtain 

F{X*,e,U) - A*(l + 2h{£)) > (1 - S^){1 + A6^)2Xubu - 2Xubu = (3(5^ - 4S'^)2Xubu > 
which is the desired result. □ 
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Lemma 14. If \U\ > } and X* > I + 25 - 5^, F{X*,£,U) > A*(l - 2h{£)). 

Proof. \U\^ + 2h{i) is a convex function that is minimized at i = \U\. Hence, it is minimized at 
£ = 1 if £ < 1 and |C/| > ^. So we have 

\U\-1 + \U\^ + 2h{£) = \U\-1- \U\£S^ + 
On the other hand, j,zu2yij <\U\. Therefore, 



F(A*,£, U) - A*(l - 2h{l)) = \*{\U\ - 1 + |C/| 



2 >\U\-l-6\U\ + \>\U\{l-l5). 



■2hii))- 2y,,>X*\U\il--d)-\U\ 



The RHS of the above equation is > ((1 + 25- 6^){1 - - 1)\U\ > 0. 



□ 



Using the properties of F{X* ,i, U), Algorithm [4] and Algorithm [5] computes A and L by enumerating 
all possible values of i. 



A* ^ 1 + 2(5. 
for ^ = 3 to i do 

Find argminy F{X*,£, U) in G. 
if Xu > X* then 
A* 4- Xu 
Repeat [3] 
end if 
end for 
return A* . 



Algorithm 5 Find L given y and A. 

1 
2 
3 



Let A* = X-S'^. 
for ^ = 3 to i do 
G' ^ G. 

Find argmin^ F{X*,e, U) in G". 
5: if |C/| = ^ and A,7 > A- then 
6: L^LU{U}. 

7: Delete all nodes in U (and adjacent edges) from G". 

8: Repeat H 

9: end if 

10: end for 

11: return L 



Proof of Theorem\ll\ First, we prove that Algorithm [H finds A with O(^) minimum odd cut prob- 
lems. Let = argmax A;/. By Lemma [T2l if Xui, > A*, we find at least one U with Xu > X* . 
Therefore, Algorithm H] finds A eventually. In addition, we find U with \U\ = i only if Xu = Xu^ 
So for each odd set size i, we update A* at most once. Therefore, we compute a minimum odd set 
at most I times. 

Algorithm [5] finds all U with A(7 > A — (5^ in polynomial time. If there is an odd set U £ L with 
\U\ = i, we find an odd set by Lemma [12] and the size of the set is £ by Lemma [13] and Lemma [14] 
So if |C/| 7^ i, no set of size £ in L is left in G. Therefore, we find all sets in L by enumerating all 
possible values of £. The number of times we need to solve the minimum odd set is 0{n + ^) which 
is the desired result. □ 



3.3 An Efficient Algorithm for Estimating zu 

The bottleneck on the running time to compute A, z[j is Algorithm [5] Algorithm [5] repeatedly finds 
argmin^ F{X* ,£, U) which solves a minimum odd cut. In Algorithm[5l we find one odd set at a time 
which causes 0(n) executions of the minimum odd cut algorithm. Padberg and Rao's minimum 
odd cut algorithm constructs a Gomory-Hu tree [18]. Instead of throwing out the Gomory-Hu tree 
after finding one odd set, we can reuse the Gomory-Hu tree using the laminarity of L and find 
multiple odd sets from the tree. Algorithm [6] is the improved version of Algorithm [5] We prove 
that the algorithm finds at least half oi U £ L with |C/| = ^ per iteration. 
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Algorithm 6 Find L given y and A. 



Let A* = A-^^ 

for ^ = 3 to i do 
G={V,E)^G. 
repeat 

If \V\ is even, add a node to V without any adjacent edge so that \V\ is odd. 
Construct a graph G' — {V {s},E' ,w) such that E' contains 

(i) with Wij = y^j and 

(ii) (z, s) with = A* (l - _ y^J■ 
Construct a Gomory-Hu tree of G". 

for each edge e in the Gomory-Hu tree with We < A*(l — 2h{£)) do 
Let ([/, V U {s} — U) be the cut obtained by erasing e from the tree, 
if |C/| = ^ then 
L^LU{U}. 

Delete all nodes in U (and adjacent edges) from G. 
end if 
end for 
until No U is added to L 
return L 
end for 



Lemma 15. Let = {U\U £ L,\U\ = £} . In Algorithmic for each iteration of line[^{ U^ we find 
at least half of remaining sets in Li. 

Proof. We use the proof of the minimum odd cut algorithm in [21]. For the details of the proof, 
see [21], volume A, page 499. Let F be the edge set of the Gomory-Hu tree. For f G F, let Wf 
be one of the two components obtained by erasing / from the Gomory-Hu tree. Let 5f{U) be the 
edges in L which are cut by {U, V U {s} — U). 

Suppose that U £ Lf. Then, there must be / = (u, v) € 6f{U) such that Wf is an odd set (from 
[21j). Wf is a minimum u — v cut and hence, the cut value is less than or equal to U. By Lemma [13] 
and Lemma [131 only odd sets of size i has cut value less than U. Therefore, Wf or its complement 
is in Li. Let this odd set he U' . li U ^ U' , Algorithm [6] does not find U but does find U' . 
Furthermore, the only way to find U' is by choosing /. So U' does not eliminate other sets in 
Li. Therefore, the algorithm covers at least half of remaining U G Lg per each iteration of line 

UM □ 

By repeating O(logn) times, we find all sets in Li. By enumerating over all possible i values, we 
obtain the following theorem which is Theorem [11] with the improved running time of Algorithm [6] 

Theorem 16. There is an algorithm that finds L in 0{j) time where r is the time to construct 
a Gomory-Hu tree for an n node graph with 0{n ■ poly{^)) edges (which is n times the number of 
passes). 

3.4 Solving the Oracle with the Lagrangian Dual 

Given z'jj, the oracle (approximately) decides if there exists y G "P such that Vij ^'u ^^^^ 
than (1 - 5) J2^j Vij ^'u " T.u ^'u^u- 
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Let 13 = {1 — 5) J2ij Uij Ylij ^'u ~ X^(7 ^'u^u- Then, we can reduce the oracle ILP5I into the 
fohowing hnear program - note that we have switched the objective function and the constraint since 
we guessed the objective function (as discussed earher in Section Note that we are interested 
in the solution of the above only if the new objective function is at least (1 — 0{5))a . 

max w^jy^j 

Y,j Vij < 1 for all i 



Taking the Lagrangian relaxation of the final constraint, we obtain the following linear program. 

(LP6) 



J2i Vij ^ 1 fo'' all i 



Observe that the objective function of ILP6I rearranges to 7/3 + Yli j y^ij ~ ^Ylu-i j&u ^uj Vij- 
Note that (3^ > and therefore (approximating) ILP6I is identical to the linear program for the 
bipartite maximum weighted matching problem except 7/3 in the objective value. We presented an 
approximation algorithm for the bipartite maximum weighted matching problem [2]. 

Theorem 17. There is a (1 — 6) -approximation algorithm for the bipartite maximum weighted 
matching problem that runs in 0(5^^ log 5^^) passes, 0{n) space, and 0{m) time. 

Corollary 18. We can find a {1 — 6) approximation for ILPdi using 0{6^'^ log 6^^) passes, 0{n) 
space, and 0{m) time; where on seeing an edge {i,j) we compute its "effective" weight to be 
'^ij ~ lYlu-i jGU -^'u ^'^'^ ^'^^ maximum weighted bipartite matching algorithm. Note that we 
discard all edges with negative effective weight. 

We use an approach similar to the solution of the linear programming relaxation of the /c-median 
problem, as in Jain and Vazirani [14j. We enumerate possible values of 7 and approximate ILP61 By 
taking a linear combination of two of them, we find an approximation solution for ILP6I Algorithm 
[3 is the approximation algorithm for ILP6I 

Definition 8. Let OPT be the optimal solution of \LP6l Let qij = j^u z'jj. 
Algorithm 7 Find y 



for fc = to [|] + 18 in parallel do 

T Sak 

Let 7 = — . 

Find a (1 — (5)-approxiniation given 7 for ILP6I given 7. 

Let y'^ be the approximation solution and LP{'y) be the objective value of y-y. 
end for 

if E.J yi;%. </3 then 

return y". 
else 

Find 7 and 7' = 7 + ^ such that J2tj vljQ^j > P and Y.^j vlj Qtj < 
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10: Let a 

El 7 
ij ^i-3 ^-^'ij 

11: return ay'' + (1 — a)y'' 
12: end if 



Lemma 19. //Ej Vij < 1 for all i, Y,ij WijUij = 9a. 
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Proof. Let M be the size of the maximum matching. We initially find a 6-approximation for M. 
The size initial matching is the lowerbound of a. On the other hand, the integrality gap is 3/2 
which means the • WijUij < |M. Therefore ■ WijUij = 0{a). □ 



^l] —i-jaij — -2,'— ■ 

Lemma 20. Suppose that 7 > (^+^^)(^") yi ig ^fi^ corresponding fractional matching returned 



by the bipartite matching algorithm. Then, YlijUljlij — 

Proof If 7 > iiif^dHifi^ then we can obtain a solution with objective value at least (1 + 26){9a) 
by assigning to all variables. If J2ij vljlij > /^j the objective value is less than ^ WijyJ^ < 9a 
by Lemma [T9l This contradicts the fact that y"^ is a (1 — (5)-approximation given 7. Therefore, 

Lemma 21. Algorithm^ returns y such that yij > (1 — 76)OPT and '^■j yijQij < f3. 

Proof. It is obvious that LP('y) > (1 — 6)0PT because we find a (1 — 5)-approximation for each 7. 
If Y^ij VijQij < the algorithm returns y° and > (1 - 5)0PT. If ^ > /3, there must 

exist 7 and 7' = 7 + ^ such that Yliij vlj > P Z^ij 2/^^- ^ by Lemma [20l By the construction 
of y, Eij y^3<lii = Then, 

(1 - 5)OPT < aLP{j) + (1 - a)LP{-f) 



+ (!-«) 



+ {l-a) 



Y.y. 

ij 

T.y^ 



■iiP-T.yti^=' 

ij 



5a 



< y^j + 7(/3 - 51 y^j^^i^ + ~rI^ ^ H y^^ + 



/3 

Since a < 60PT, Y.ij WijVij > (1 - 76)0 PT. 



□ 



Hence, if q is less than (1 — 75) fraction of the maximum matching, the oracle finds a y G P such 
that improves the solution significantly. Combining all the components, we obtain a (1 — 0{5))- 
approximation of the maximum weighted matching. Summarizing the entire discussion, we obtain 
the proof of Theorem [71 



3.5 Proof of Theorem [7] 

Proof. The width parameter of lLPSl is 0(1) and the number of constraints is 0(n~ ). Combined with 
our choice of k, the total number of iterations for the fractional packing framework is 0{-^ logn). 
For each iteration, we compute A and z[j which takes 0{^) time where r is the time for constructing 
a Gomory-Hu tree. (Using the preflow-push algorithm [13], it takes 0{n?m') time for constructing 

~ ~ 3 

a Gomory-Hu tree where m' is the number of edges. In our case, m' = 0{^) and hence, r = O(^) 
suffices.) For lLPGl it takes O(^logi) passes and O(^) time. Since we execute the algorithm for 
0(7) guesses of the optimal solution in parallel, we obtain the desired number of passes, space, and 
time. Note that the space is dominated by the total number of edges chosen by the oracle. Each 
oracle execution returns at most O(^) edges and therefore, the required space is O(^). □ 
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4 Estimating the Size of the Maximum CardinaUty Matching 



In this section, we present an algorithm that approximates the size of the maximum cardinahty 
matching problem (MCM) in 0(2) passes, 0(n^+^/P) space and polynomial time. The algorithm 
solves the dual linear program of MCM in general graphs and only approximates the size of the 
optimal matching. ILP7I and ILP8I are the primal and dual program for MCM in general graphs. 
Note that ILPTl is equivalent to lLPlI with Wij = 1 for all 



max 

s 



t < 1 s.t + X, + E.j^c/ > 1 v(i,i)e£; (lps) 

V „..< Ml WT x„zu>0 

2^{i,j)eu Vv ^ [ 2 \ 

y^l > 

We first use the following simple algorithm SimStreamMatchEst and subsequently show how to 
improve the number of passes taken by the algorithm. 

Algorithm: SimStreamMatchEst 

1. As in Section [STTl we bound the size of odd sets to |. The resulting linear program (1 — 5)- 
approximates iLPTl and has at most n's constraints. 

2. We formulate the dual linear program as a fractional covering problem with a small width 
parameter. 



max A 

s.t X, + Xj + Ezjec/ zu>X y{i, j) gE 

, I |c/| I ^ IJ^^y; 



In particular, we use the Cunningham-Marsh theorem to show that we can add 2xi + 
'^Zi^u < 3 to ILP8I without changing the optimal objective value. Note that j^u zjj < 
X^igf/ Zfj and hence the width parameter is 4. 

3. The oracle for ILP9I solves the following problem given yij. ILPlOl is the oracle and ILPlTl is its 
dual LP. 



max Y.i Ej y^J + E(7 Y.i.jeu V^j Ez 30 + Cc 

(LPIO) (LPU) 

If we have an oracle that approximates the above subproblem ()LP lOp in near linear space 
and polynomial time, we can construct an algorithm for ILP8I using the fractional covering 
framework of |20] . However achieving such an oracle is nontrivial and we describe how to 
design an approximate oracle in the next few steps. 

4. To solve the oracle problem given in ILPlOl we observe that the bottleneck arises in two 
steps. First, the weights yij cannot be stored (since they are m in number) and therefore 
the objective function of ILPIOI itself is an issue. Second, the LP has to be solved in a space 
efficient manner. Third, the solution has to be stored in a space efficient manner such that 
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the YlijeU-^u can be computed when we generate yij (for the next iteration). We address 
these issues as follows: 

(a) First, we sparsify yij so that 0{n) edges has non-zero values while approximating the 
optimal solution. This can be done with a semi-streaming sparsification algorithm [1]. 
We store the sparsification and therefore have random access to it. 

(b) We then solve the ILPlOl with the multiplicative weights updated method In each 
iteration, we find a violated constraint in ILPllI and construct a dual witness using the 
constraint. We can also use the ellipsoid method to solve ILP 1 II using violated constraints 
(however we need to be careful about the next step). 

(c) However, we need a near linear size sketch of a solution for ILPIOI and use the Johnson- 
Lindenstrauss Lemma to sketch Y^i j^u zu. Note that we keep updating this sketch as 
we keep on generating U, zjj since storing these sets till the end of the oracle need not 
be space efficient. 

Combining the above components, we obtain the following theorem. 

Theorem 22. The semi- streaming algorithm SimStreamMatchEst approximates the size of the 
maximum cardinality matching to a (1 — e) factor in 0(^^^) passes, 0{^) space, and O(^) time 

" ~ 3 

where r is the time to compute minimum odd cut over a graph with ©(p-) edges (t = O(p-) 
suffices). 

Space Pass Tradeoffs and Constant Number of Passes: The algorithm SimStreamMatchEst 
uses a number of passes which depends on n. In Sect ion H?6] we describe a method StreamMatchEst 
to process multiple iterations (of the overall framework) in one pass. 

Theorem 23. The semi- streaming algorithm StreamMatchEst approximates the size of the 
maximum cardinality matching to a (1 — e) factor in O(-) passes, 0( "^ti'^^ ) space, and O(^) 
time where r is the time to compute minimum odd cut over a graph with O(^) edges (t = O(^) 
suffices). 

Roadmap: In Section [4. II we show that the addition of the constraint does not alter the problem. 
In Section 14.21 we prove that an approximation solution given the sparsification of yij is an approx- 
imation solution of ILPlOl given y^j. In Section [4.31 we describe a polynomial time algorithm that 
finds a violated constraint. In Section [4.41 we present an algorithm that sketches zjj', the algorithm 
produces a sketch that estimates Yli ji^u for all pairs. Theorem [22] is proved in Section [^31 
In Section [46l we describe the pass space tradeoffs and prove Theorem [23l 

4.1 The Fractional Covering Formulation 

We use the Cunningham-Marsh Theorem (Section 12. 1| Theorem [2] to show that we can add 2xi + 
Yli(^u ^c/ ^ 3 to ILP8I without changing the optimal objective value. This is based on the following 
straightforward corollary (for Wij = 1 only). 

Corollary 24. There exists an optimal solution for \LP^ such that all Xi and zjj are or 1 and 
U with zu = 1 are disjoint sets. As a consequence we can add 2xi -\- X^jgj; zu < 3 to \LPS\ without 
changing the optimal objective value. 
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Proof. If Xi or zu is 1, it satisfies all the constraints that constrains the variable. So it is obvious 
that Xi and Zu are or 1 in an optimal integral solution. If Zf/ = 1, it satisfies all the constraints 
that correspond to € U' for U' C U. Hence, zjji = for all [/' C [/ in the integral optimal 

solution. Therefore, if {U\zu = 1} is laminar in an optimal solution, such sets are disjoint. 

The feasibility of the additional constraint follows immediately. □ 



4.2 Sparsification of jij, the input to the Oracle 

In each pass, we have a candidate solution for ILP9I and compute yij based on the candidate. 
However we cannot store these yij values. We use the following result: 

Theorem 25. Jl^ There exists a single pass semi-streaming algorithm that returns a sparsification 
of size 0{n/e^) that estimates all cut values within (1 it e) factor. 

We generate a stream of yij as we stream through the edges, and construct a sparsification ijij 

such that for any C/, (1 - VC) EiGl/,j^t/ ^ T^i^uMuVij < (1 + V*^) EieC/j^c/ wher e C is 
a constant to be defined later. We prove two lemmas which show that we can approximate ILPIOI 
given ijij instead of yij. 



Lemma 26. Suppose that (x, z) is an optimal solution o f \LP10\ given yij and its objective value is 
j3. Then, the objective value of (x, z) given yij is at least (1 — 6/2)f3. 

Proof. If Yliij^uy^i ^ zYliieuYl/jVij^ = 0. Suppose that it is not true. Then, we can increase 
Xi by ^zjj for i G [/ and set zjj = 0. Then, left hand side of each constraints in ILPlOl decreases and 
therefore the modified solution is still feasible. On the other hand, the objective value increases by 



zu 



> 0. 



Now consider the case Y^ij^u yij ^ i T^ieU Y.j yij^ we have: 

1 



J2 y^i 

1,0 &u 



ieu j ieu j^u 



> 



ieu j 



{i + s/c)j2J2y^i 

ieu j^u 



6 



o \ ^ \ ^ \ ^ 4(5 X ^ 



c 



C 



Vrj- 



For C > 8, we have Y.i,j(^uyij > (1 " ^/'^)T.i,jeu Vij and J2jyij > (1 - ^/'^)J2jyij for ah i. 
Therefore, the objective value of (x, z) given yij is at least (1 — 5/2)/3. □ 



Lemma 27. Suppose that (x, z) is a solution of \LP10{ with the objective value less than (1 — 5)f3. 
Then, the objective value of (x, z) given ijij is at most (1 — 5/2)13. 

Proof. Suppose that the objective value of (x, z) given y^j is (1 — q)(3. From the proof of Lemma 
23 if Yji,j(iU yij < i YjidU Yjj yij^ we can increase the objective value by at least zuj2 Y^iau Yij yij 
by modifying values of x, z. Therefore, Yu ^Uj2Yi£uYj yij — Qf^ fo^ such sets U. From the 
sparsification. 



J2 y^^ 

t,jeu 



< 



J2Y.y^i-Y.Y.y^i 

ieu j ieu j(fu 

J2J2y^i-Y.J2y^^ 

ieu j ieu j^u 



< 



(1 + 5/c) J2 E - (1 - '^/c') E E 

ieu j ieUj^u 



EE 

ieu j 



Vij 



E y^i 

i,jeu 



C 



ieu j 



Vij- 
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So the error term derived by such U is at most If Zlijei/ Vij ^ i SieC/ X^j ^^i' T^i,jeU Vij - 

{1 + A6/C) j^jj Uij . We also have Ylj Vij ^ (1 + ^/C*) yij • From these three inequahties, the 
objective value of (x, z) given ijij is at most (1 + 4(5/C)(l — g)/3 + {125q/C)j3. For a sufficiently 
large C and g > 5, the value is at most (1 — 5/2)j3. □ 

4.3 Solving the Oracle 

In this section we discuss how to solve the oracle described in ILPlOl We will be using the multi- 
plicative weights update method (see Section [2.4p . This method solves the dual problem explicitly 
and produces a certificate of optimality for the primal. Therefore we will apply the method to the 
dual of ILPlOl and have an explicit solution of the dual of the dual of ILPIOI (which is ILPlOp . The 
dual of lLPlOl is described in lLPTTl 

min Y.i + Ca 
s.t 2C, + iCa>Ejy.j (Type-A) 
l:^euC^+V%^Q.>T.^.,^uy^3 VC/ (Type-B) 

If the number of nodes in the graph is even, we first add a node with no edges adjacent to it 
and use the multiplicative weights update method. We will explain the reason why we add the 
node later. In each iteration of the multiplicative weights update method, we are given weights C,i 
for each node i and Ca which correspond to the constraints of ILPlOl We find a violated constraint 
of ILPlll and construct a dual witness with it. Algorithm [8] is the oracle for ILPlll 

Algorithm 8 Oracle for ILPlll 
1: Normalize C,i and Ca so that 3Ci + Ca = /3- 

2: if there exists a node i such that 2C,i + < J2j Vij ^^"^ J2j Vij > 5(3 /a then 
3: Return xi = (3/ yij(and for other variables). 
4: end if 

5: Co ^ Ca + 5j3. (Ensures that all type-A constraints are satisfied; since we can have J^jVij — S/S/a 
above.) 

6: Ca ^ (1 + 2(5)Cq. (Ensures that U with \U\ > | is not used in line ITT]) 
7: Add a node s to the graph where an edge (i, s) has weight ^{2Q — ^Ca ~ J2j Vij)- 
8: Find a minimum odd cut {S, U) with s G S. Note that both S and U are odd sets. 
9: if the minimum cut value is less than {(a — Sf3)/a then 
10: Let A = ^,^ ^.gyy,,. 

11: Return zjj = /3/A(and for other variables). 
12: else 

13: The oracle reports failure. We indicate the verification step: Increase Cq by 6(3 and return {{Ci},Ca) 

as the overall primal solution. 
14: end if 



Lemma 28. Algorithmic returns o,n admissible solution with ^ = 2> and p = 0{n/5). 

Proof. It is obvious that c-^y = /3 from line [3] and line [TTJ For the other conditions of admissibility, 
we show that we only pick a constraint where the right hand side is at least It is obvious in the 
case of type-A constraints. If there is no violated type-A constraint whose RHS is at least 5 (3 /a, 
we satisfy all type-A constraints in line [5] by increasing • 
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For an odd set U , we have 

EC. 4^ 



= ^E 2c. + ^c.-E2^'0 

The first term is the weight of an edge between s and i G U and the second term is the cut value of 
U excluding edges to s. So if the cut value of U is less than ^Co ~ the constraint corresponding 
to U is violated by at least Since the left hand side of a constraint is always positive, this proves 
that the right hand side is at least — . 

In addition, if |?7| > |, the corresponding cut value is greater than {(a — 6(3) /a. Let ('^ and Ca 
be the value of Ca before and after line [U] since for |{7| > |, 



E 2c. + ^c«-Ey^^UEEy'^-^ 



1 + 26 ^ l^Y-Y- i±2^^ 



a 



a 



Ef^o + ^c-E^/.) 

26\U\ 1 + 25 

+ 2^2^ y^i + — — C - — — C 



> 0. 



So we assign at most j to Xi or Z(/ (with < ^) and the values are all positive. Also only 
one variable has positive value. For we have M(z, y*) = 2xj + J2ieU — and for Qa, we have 

M(a,y*) = ^ [E.^;^ + Ec/L^J^c/] - 1- Then, -3 < M(i,y*) < 2a/,5 and -1 < M(Q,y*) < 6n/2 
Since we only choose a violated constraint, we have the following inequality. 

1 



M(I?*,y*) 



< 









1 




^E.O 




1 




f E.C. 




1 




f E.O 




1 




f E.O 



CaM(a,y*) + EOM(i,y*) 



Ca 



E^' + E 



M 

2 



c. E + E - 3 



IC/I 



E--^ E 



Vij 



E I a^i + + E '^^\ ~ ^ 



E I E y^o 1 ~ /3 



□ 



Lemma 29. If the oracle reports failure, it returns a feasible solution for \LPll\ with value at most 
(1 + 56) f3 is returned. 
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Proof. If a type-A constraint is violated more than 6P/a, the oracle returns a dual witness. Other 
constraints are satisfied by line [5] in which the objective value is increased by 6f3. In line EJ we 
increase the objective value by at most (1 + 26) factor. After that, if a type-B constraint is violated 
more than 6f3/a, the oracle returns a dual witness. If not, the constraints are satisfied by line [13] 
and the objective value is increased by 6/3. Initially the objective value is /3. So the oracle returns 
a feasible solution for ILP 111 with value at most (1 + 2(5)(1 + 5)f3 + 6(3 < (1 + 56)13 (for sufficiently 
small 6). □ 

4.4 Storing zu and Estimating 'Ylii'j<^u^u (to compute yjj) 

From Lemma [28l Lemma [29l and Theorem^ we can find a (1 — (5)-approximation of lLPlOl in O(^) 
iterations. However, the size of the output for ILPIOI can be superlinear in the number of nodes. 

Note that the analysis in Section [4.31 requires the estimation of 'Ylii ji^u with 5 additive error 
not the exact value of zu for all U . In this section, we present an algorithm that produces a 
sketch of zu that approximates Yliji^u^u within 6 additive error for all i,j pairs. We use the 
Johnson-Lindenstrauss lemma [15]. Especially we use the fact that we can construct a random 
linear mapping for the Johnson-Lindenstrauss lemma obliviously [6]. 

Theorem 30. Suppose that 0<e<l,nisa integer, and k = [-^logn] with a constant C. 
Let V he a set of n points in and let A be a d x k matrix such that Aij ~ -^N(0, 1). Then, 

(1 — e)||M — tip < ll^u — AvW^ < (1 + £)\\u — f IP for all u,v G V with high probability. 

Note that the construction of A is oblivious to V, i.e., we can construct A without reading V. 
Let C/i, C/2) • ■ ■ be the sets we chose at time t and zt be the weight we give to Ut. Let Xi be a 
vector such that 

X ^ I ^ if « e C/t 
1^ otherwise 

Then Yli j&u = iW^iW^ + W^jW^ — W^i — ^jW^)/"^- For each i, we store AXi instead of Xi and 
A can be generated column by column as we proceed. Then, we estimate and \\Xi — XjW^ 

within 1 lb 0{6) factor. Since = Ylii^u ^ 3, the multiplicative error translates into 0{6) 

additive error in H-^^jp + — \\Xi — XjW^. The total space required is O(p-logn). 

4.5 Proof of Theorem [22] 

Proof. The sparsification step requires Oi^) space (and 0{m) time). The space required for the 
sketch of zjj is also 0{^) since we do not have to remember the random matrix used for the 
Johnson-Lindenstrauss lemma. The space requirement of this step is dominated by the space 
requirement for the sampled edges. 

The sampling and the construction of the sparsification can be done in 0{^) time. We solve 
ILPlOl in O(^) iterations. The minimum odd cut algorithm takes 0{n'^m') time as mentioned in the 
proof of Theorem [22] where m' is the number of edges. The number of edges in the sparsification cab 
be reduced to 0(^2-) by applying the sparsification algorithm again. So finding a minimum odd cut 

~ 3 

takes r = O(p-) time. Note that the time to construct sparsifications is dominated by the minimum 
odd cut algorithm. We need O(^logn) iterations of the violation oracle. So the algorithm takes 
O(^) time in total (for a particular guess of optimum solution a). Since we execute the algorithm 
for O(^) guesses of the optimal solution, we have additional ^ term in the space and time. □ 
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4.6 Space-Pass Tradeoffs and Theorem 1231 



The sparsification algorithm in [T] is based on samphng of edges. The samphng probabihty of an 
edge depends on the edge's weight and strong-connectivity where the strong-connectivity is defined 
as fohows. 

Definition 9. -A node-induced subgraph G' is k-strong connected if the size of a minimum cut 
in G' is k. An edge e is k-strong connected if e is in a k-strong connected component. 

If an edge is fc-strong connected and has weight y, its sampling probability is The 
proofs in [T, 1] hold even if we increase the sampling probability. Hence, if we know the upperbound 
of weight and the lowerbound of strong connectivity for every edge, we can process the sampling 
step first and construct the actual sparsification after the edge weights are fixed. This can be used 
for processing multiple iterations in one pass. 

Lemma 31. In AlgorithmUl x be the primal candidate in time t and x' be the primal candidate 
after the update, i.e., x' = (1 — cj)x + fjx. Let y and y' be the corresponding dual candidates. For 
any constraint, 

exp{-6/2) <^ < exp{6/2) 

Proof. Note that, y- = exp(KAj((TX — crx)/bj) = exp(K(TAj(x — x)/bj). By the definition of width 
parameter, —2p < Aj(x — x)/bj < 2p. Since a = we obtain the desired result. □ 

Therefore, the edge weight (and the strong connectivity) increases or decreases by at most 
exp(A:5/2) factor in k iterations which leads to at most exp{k6) factor increase in the sampling 
probability. For each pass, we sample k sets of edges which we will use to construct sparsifications 
for k iterations. The sampling probability is exp{k5) times the sampling probability in p]. Once 
we have such sampled edges, we can process k iterations without reading the data stream. Now we 
prove Theorem 1221 

Proof. (Of Theorem 1231) Since we increase the sampling probability by exp{k5), in the sparsifica- 
tion, it increases the size of sampled edge set by exp{k6) factor. Since we have k such sets, the 
space requirement for the sparsification is 0{k exp{k6)n/ 6'^) and the number of passes is 0(^|f^). 
By using k = and S = 0(e), and following the rest of the proof of Theorem 122^ we obtain the 
space and the number of passes. □ 

5 Estimating the Size of the Maximum Weighted Matching 

In this section, we estimate the size of the maximum weighted matching in general graphs by 
applying the algorithm in Section [J] to the maximum weighted matching. The dual linear program 
formulation for the maximum weighted matching is as follows: 

mill Y.;X.i + J2ul^^\zu 
s.t Xi + Xj + Y,ij^u > Wi] V(i, j) G E (LP12) 
> 

Edge weights introduce complications which prevent direct application of the fractional covering 
formulation and the corresponding oracle of Section [H In particular the proof of Corollarv 1241 does 
not apply to weighted graphs. We proceed in the following three steps. 
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1. We first run a filtering step to eliminate very small weights. This reduces the width of the 
inner step, or the violation oracle, such that we can solve the oracle in small space and small 
number of passes. 

2. We then run a second filtering step to ensure that the ratio of the largest to smallest weights 
incident to a vertex are bounded. This reduces the width of the outer step, or the overall 
fractional covering framework, such that the number of passes are bounded. 

3. We then slightly modify the LP formulation for the unweighted case in Section [H 
As a consequence we achieve the following result: 

Theorem 32. We can approximate the size of the maximum weighted matching in O(^) passes, 
6( " ^i^^ ) space, and 0{^) time. 

We present the three steps and the proof of Theorem [32] in the remainder of the section. We 
first show how to reduce the inner width (step 1). We then discuss the formulation we need to 
solve (step 3), and based on the formulation we show how to achieve the second preprocessing (in 
step 2) to reduce the width of the fractional covering framework. 

5.1 Preprocessing I: Filtering Small Weight Edges 

The width parameter of the violation oracle is 0{a/6) which is 0{n/6) for MCM. However, a could 
be arbitrarily large (compared to the minimum edge weight) for MWM. We modify the input graph 
so that a = 0{n/5) and the minimum edge weight is 1. 

We first execute an one-pass 0(l)-approximation algorithm on the graph and compute the 
lowerbound of the optimal solution. Let this value be A. We delete any edge whose weight is less 
than 6A/n. The weight of the optimal solution decreases by at most 6A. Since the a = 0{A) and 
the minimum edge weight is 6A/n, by scaling the edge weights, we obtain the desired graph. The 
width parameter of the violation oracle would be 0{n/6'^) compared to 0{n/6) in Section [4.31 

5.2 The Modified Fractional Covering Formulation 

Let $j = maxj Wij and = miuj Wij. We use the following fractional covering formulation. 



max A 

s.t Xi + Xj + Y^,j(,u > >^Wrj V(i, j) eE 



We first show that the above formulation is valid, i.e., it approximates the optimal solution of 
[LPT2I 

Lemma 33. Let OPT be the optimal solution for \LPl^ Then, there exists a feasible solution (with 
\ = 1) for \LPl^ whose objective value is at most (1 + 5)0PT. 

Proof. We start with an optimal solution (x, z) of lLP12l and construct a feasible solution for ILP131 
Let i be a node such that 2xi + Eie;/ -^(7 > For each U with i G [/, we increase Xj with 

j & U, j ^ i hy ^ and set zu to be zero. For i, we assign <I>j to Xj. For edges adjacent to i, the 
edge is satisfied by the value of Xj. For edges (i',j) not adjacent to i, we increase Xj' + Xj by zu 
while set zjj to be zero. So the constraint is still satisfied after the change. 

The process increase the objective value by ^j. But it decreases '^jj zjj by at least Since 
Ec/ is less than OPT initially, the total increase in the objective value is bounded by 50PT. 
Therefore, there exists a feasible solution for lLP13] whose objective value is at most {l+S)OPT. □ 
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5.3 Preprocessing II: Reducing the Width of ILP13] 

The width parameter of ILP13I is ^ maxj The parameter depends on the largest weight ratio 
between two edges that share an endpoint. Algorithm [9] finds a subgraph G' that contains a 
sufficiently large matching while the width parameter is bounded. The algorithm is a modification 
of Algorithm 7 in [2]. 

Algorithm 9 Finding a subgraph G' 

1: for each tier A: = 0, 1, • • • , L in parallel do 

2: Find a maximal matching. 

3: Let Ck be the set of nodes matched in the maximal matching. 

4: Let Si ^ Ck 

5: for t = 1 to g do 

6: Find a maximal matching between Ck and V — S\. 

7: Let T| be the set of nodes matched in the maximal matching. 

8: = 5* U Tl 

9: end for 

10: end for 

11: Let u\ = (1 + (5)*^ for the maximum k with i ^ S^. 

12: Return G" = {V,E') with E' = : ^Uj, < < Ui,Uj}. 



Lemma 34. The width parameter o f \LP13i is bounded by . 

Proof. Follows from inspection of line line [121 □ 

We now show that G' has a sufficiently large matching for a suitable setting of q. 

Lemma 35. Let OPT be the size of maximum weighted matching and q = 0{-^ log G' contains 
a matching of size (f — 0{5))0PT . 

Proof. First, we eliminate all edges (i, j) such that Wij > Ui or Wij > Uj and let G" be the resulting 
subgraph of G. G" contains a matching of weight at least (f — 0{5))0PT [2]. Now we fix an 

optimal solution in G" and eliminate all edges such that Wij < —Ui or Wij < ^uj. For each 

i G Cfc, we pick at most q neighbors in tier k which eliminates at most 5'^{1 + 6)^ weight. Let (i, j) 
be the edge matched in the initial maximal matching of tier k. Then, at least one of i and j must 
be matched to an edge with weight at least (1 + 6)^/2 in the optimal solution since otherwise we 
can improve the optimal solution by replacing the edge. Let this edge be e. We charge the weight 
of edges that are eliminated by i to e. Each edge e in the optimal solution can be charged at most 

We 

2 V y < 26We. 

Therefore, we obtain a matching of size at least (1 — 0{5))0PT. □ 
5.4 Proof of Theorem [32] 

The approximation ratio is guaranteed by the discussion in Section 15. f I and Lemmas [33] and [35] 

The width parameter is ^ log ^ based on Lemma [33] The rest of the algorithm and analysis is 
identical to Section [J] (except that the multiplicative weight update step has width n/S'^ instead of 
n/6). Thus we obtain the Theorem [321 
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